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Abstract 

I show a simple expression of the Mill’s ratio of the Student’s t-Distrihution. I 
use it to prove Conjecture 1 in Peter Auer, Nicolo Cesa-Bianchi, and Paul Fischer. 
Finite-time analysis of the multiarmed bandit problem. Mach. Learn., 47(2-3): 
235-256, May 2002. 

We first need the following technical lemma. 

Lemma 1. For all x > 0 we have 
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Proof. Denote hy y = , with y > 1. We have 
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Hence we have 
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where in the last inequality we used exp(z) — 1 > z. 


□ 


The following theorem provides simple upper bounds to the Mill’s ratio of a t- 
Student. 


Theorem 1. Let f,^{x) the pdf of a Student’s t-distribution with v degrees of freedom. 
Then, for any u > 0, we have 
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Proof. The first stated inequality holds for a = 0, for the symmetry of the f-Student 
distribution, hence we can safely assume a ^ 0. We have that 


p[x > a] = a 
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where C,, = 
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Cv = ■ With the change of variable z = v\og{\ + ^), we have 
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where in the inequality we used Lemma [T] We now use the facts that P °° e *2 da; < 
ie”"^ for a > 0 and e“^ dx < e~^ for a < 0 to have the stated bounds. □ 

2 — Ja — — 


Th e following Corollary is a slightly better version of Conjecture 1 in lAuer et al.l 
1 2 OO 2 I 1 . I could not find a proof of it in any paper so I decided to give a simple proof 
for it. 


Corollary 1. Let X be a Student’s t random variable with v degrees of freedom. Then, 
for 0 < a < \/2{v + 1.22) and iz > 0, we have 

P[X > a] < e~^ . 

Proof. First observe that {^ + K = 0.543, as it can be verihed numeri¬ 

cally. Using the hrst result of Theorem[T|we have 
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where in the last inequality we used the fact that log(x+l) > Hence the statement 
of the theorem is equivalent to find the upper bound on of such that 
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that in turn is equivalent to 
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Solving the quadratic equation we have 


<v-2 \og{K) + +A\og^{K) - 12i/log(it:) (3) 

Hence, the condition — 4 log(itr) satisifies the inequality above. Using the 

value of K we have the stated bound. □ 
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